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ABSTRACT: 

Let n = b\ + • • • + bk = b[ + • + b' k be a pair of compositions of n into k positive parts. We 

say this pair is irreducible if there is no positive j < k for which b± -\ bj = b\ H The 

probability that a random pair of compositions of n is irreducible is shown to be asymptotic 
to 8/n. This problem leads to a problem in probability theory. Two players move along a 
game board by rolling a die, and we ask when the two players will first coincide. A natural 
extension is to show that the probability of a first return to the origin at time n for any 
mean-zero variance V random walk is asymptotic to \JV j (2ir)n~ 3 / 2 . We prove this via two 
methods, one analytic and one probabilistic. 
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1 Introduction 



By a composition of n into k parts we mean an ordered representation of n as a sum of k 
positive integers. Let C and C denote respectively the compositions n = b\ + • • • + b^ and 
n = b'i + • • • + b', of n into k parts. We'll say that C,C are an irreducible pair if for every 
j = 1, 2, . . . , k — 1 we have b\ + • • • + bj ^ b[ + • • • + b'j, while, of course, equality holds at 
j = k. Note that we allow b\ + . . . + bi = b' x + ■ — h bj for i / j. 

Our starting point for this note is the following. Let f(n) denote the number of irre- 
ducible ordered pairs of compositions of n into the same number of parts. 



Theorem 1.1 

= _?L . (i.i) 

T/iis gives a combinatorial interpretation to sequence A081696 of Shane's database. Fur- 
thermore, if we let p n denote the probability that a pair of compositions is irreducible when 
chosen uniformly at random from among all pairs of n compositions into an equal number 
of parts, then 

f(n) ~ ^n^e- (1.2) 



Vn ~ -■ (1-3) 

n 

Remark: A somewhat similar problem about integer partitions was studied by Erdos et 
al |ENH92j. 

The asymptotics in (|1.2j) and Ijl.HJI are derived from the exact computation (jl.lj) . We 
compare this to another well known paradigm for analyzing compositions, namely pois- 
sonization. It is well known that a uniform random composition of n may be gener- 
ated by letting {Yj : j > 1} be independent random variables whose distribution is ge- 
ometric with mean 2, that is, P(Yj' = i) = 2~ l . Let Wk '■= Sj=i^j denote the par- 
tial sums. If T := min{/c : Wk > n} is the first time the partial sums exceed n, then 
Y\ + - ■ ■ + Yt—i + (n — Yt-i) is uniformly distributed over compositions of n. The number of 
parts of the composition is T, which is asymptotically normal with mean n/2 and standard 
deviation ®{^Jn). Conditioning on T = k gives the uniform distribution on compositions of 
n into k parts. 
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From this viewpoint, a pair of compositions with the same number of parts is just 
a pair of independent random walk sequences {Yj} and {Yj}, conditioned to have the 
same stopping time T = T' . Irreducibility of the pair corresponds to Wj ^ Wj for all 
1 < 3 < T - 1. Let S k := W k -W' k = Ylj=i( Y j ~ Y j) ■= Ej=i X 3 be the Partial sums of 
the difference sequence {Aj}. Then irreducibility corresponds to r = n, where t is the first 
return time, that is, r = min{/c > 1 : S\~ = 0}. 

A rigorous proof of ()1.3|) via analysis of the return time of {S n } to the origin would 
require, among other things, showing that conditioning on T = T' does not significantly 
affect the distribution of the return time. This would be far messier than the compact 
proof of Theorem 11.11 lo elow . Nevertheless, the poissonization paradigm raises the question 
of the distribution of the return time of {S n } to the origin. The same question arises in a 
game similar to Parcheesi with only one token per player. Here, the two players each roll 
a die and (simultaneously) advance their token the number of positions shown on the die. 
When the tokens collide, they must both go back to start. The chance of the first collision 
occurring at time n is just a n := P(r = n). Our main result is the following asymptotic for 
a n : 

Theorem 1.2 Let {Xj : j > 1} be independent with mean zero, finite variance, V , and no 
periodicity (that is, the GCD of times n at which it is possible to have S n = 1 is 1). Let 
{Sn}, t and a n be as above. Then the probability a n of the first return to occurring at 
time n is asymptotically given by 




Surprisingly, given the wealth of knowledge about random walks, we were unable to find 
this theorem in the literature. The formula is not surprising, and is what one obtains in a 
thumbnail calculation by "differentiating" with respect to n estimates such as Ij3.9|) below, 
which is an estimate for Q n := F(Sj ^ 0, VI < j < n). There are special cases, such as 
the simple random walk where S n = ±1 according to a fair coin-flip, in which a n is easy 
to compute exactly. Asymptotics in the general case are well known for many quantities 
such as Q n and a' n := P(5' n = 0), but we could find no text that included asymptotics for 
a n and indeed these seem tricky to obtain by probabilistic methods; a probabilistic proof 
of Theorem 11.21 is the subject of the last section of this note. 

In the remainder of this section, we prove Theorem ll.il In the subsequent section we 
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prove Theorem 11.21 by analytic means. In the final section, we give a probabilistic proof of 
Theorem O 



Proof of Theorem II. It Let f(n,k) be the number of irreducible ordered pairs of com- 
positions of n into k parts. We will show that 



Exy + x 2 (l - y) 2 - 2x(l + y) - xy 
f(n,k)x n y k = ^ ; 57 r '- (1.4) 
. ., ' l-2x(l+y)+x 2 (l-2y) V ; 

from which (|l,lj) follows by setting y = 1. 

To show 1)1.4(1 . by considering the number of ordered pairs of compositions of n into 
k parts such that the partial sums of the parts agree with each other at indices k\ , k\ + 
&2 , . . . , &i + . . . + k r , we see that 

X X f(mi,k 1 )f(m 2 ,k 2 ) . . . f(m r ,k r ) = K _ , 

y*^>\ Wli +• ■ ■-\-Tn r — n ^ ' 

fel +. ..-\-k r = k 

the right side being the total number of pairs of compositions of n into k parts. Hence if 
F (x, y) = En,fc>i Z( n ' k)x n y k , we have 

2 



1-F ^ U-l 

n,fc>l v 



x v / , \ k \ x y 



2 



i,fc>0 



xy 

n>0 



xy 



v / l-2x(l + j/) + x 2 (l-y) 2 ' 

in which the P n 's are the Legendre polynomials. The claimed result ()1. 4(1 now follows by 
solving for i* 1 . 

The estimate 1)1-2(1 follows from 1)1-1)1 via standard Tauberian theorems. The result of 
Flajolet and Odlyzko, for instance (Theorem 12. II quoted below) suffices, although ((1.2)1 may 
also be obtained by the method of Darboux which requires a smaller region of analyticity. 
Since there are 

^— ' V k - 1) V n - 1 



n-l\ 2 /2n-2 x An ~ l 



n7r 
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ordered pairs of compositions of n with the same number of parts, it follows that the 
probability that a random pair of compositions of n with the same number of parts is 
irreducible is ~ 8/n. □ 



2 Analytic proof 



Let H (z) := Yl n >l a nZ n be the generating function for the probabilities a n of first return 
at time n. Let G(z) := J2 n >o a n zn '■> where a' n = F(S n = 0) is the probability of a return to 
the origin at time n but not necessarily the first return (set a' = 1 and ao = 0). Then G 
and H are analytic on the open unit disk and G = 1/(1 — H). We will use this to obtain H 
from G, while G in turn is obtained from the two-variable generating function 

F(z,w) := J2J2 F ^=3> nwj - 

n>0 jGZ 

Finally, we may write F = 1/(1 — zg(w)) where 

g (w) := ^2b n w n 

is the generating function for X±. 

The following estimates are elementary. From the local central limit theorem |Dur04l 
Theorem (II.5.2)], as n — > oo, 

< ~ 1/2 ' (2 ' 5) 

Consequently, 

G(z) ~ ^L(i-.)-V2 (2 . 6) 
as z t 1- To see this, let e denote 1 — z and compute 

= y _^ n -l/2 e -n £ (l+o(l)) 
n>0 

= _^ e V2 y (ne )-i/2 e -Mi+ (D) 

v n>0 

~ -J— e- 1 / 2 r x- x l 2 e~ x = -L(l-z)-V2 



using dominated convergence at the first approximation. 
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Finally, for H = 1 — 1/G, we have the estimate 

l-H(z) ~ VzVil-z) 1 / 2 (2.7) 

as z | 1. The proof of Theorem II . 21 rests on these estimates and on the following Tauberian 
theorem of |F()9()j : 

Theorem 2.1 (Flajolet-Odlyzko (1990)) Say that a region R is a Camembert region 7 
if it is of the form R t := {\z\ < 1 + e and | arg(z — 1)| > ir/2 — e}. If a function H is 
analytic in a Camembert region and H(z) ~ C(l — z)~ a near z = 1, then its coefficients a n 
satisfy 

C a-l 

7(") 

□ 



Proof of Theorem 11.21 Let C denote the unit circle. For fixed w G C, the function 
F(z, w) is analytic as z varies over the open unit disk; this follows from absolute convergence 
of the power series. It also follows that F(z, w) is continuous in (z, w) on the product 
0, := D x C of the open unit disk with the unit circle. For fixed w G C, the Cauchy integral 
formula gives 

z n F(S n = 0) = z n I VP(5 n =j'K-. 

J w 

J 

We may sum this over n and exchange the sum and integral as long as \z\ < 1, leading to 

SW-r/^*, (2.8) 

where 7 goes around the unit circle, counterclockwise 8 . 

Suppose we can show G to be analytic in a Camembert region. It follows that H = 
1 — 1/G is meromorphic in a Camembert region, and since a function whose coefficients go to 
zero may have no poles in the closed unit disk, it follows that H is analytic in a Camembert 
region. The conclusion of the theorem will then follow from (|2.7|) and Flajolet-Odlyzko. 

7 Named, by French mathematicians, for its shape. 

8 This integral formula is used in |HK71| to derive a result (attributed to Fur67| by |Sta9 9 ) implying in 
this case that G is algebraic whenever g is rational. In fact, in the case where Xi has finite support, one 
may use this implication at the next step to avoid having to examine the power series expansion of g. 
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Claim: There is a Camembert region R such that z ^ \/g(w) for any z £ R 
and w on the unit circle, C. Consequently, F(z, w) has an extension to R x C 
that is analytic in z and continuous in (z,w). 

Proof: The facts that X\ is a probability distribution, has mean zero, and 
has variance V translate into three facts about g, namely, g{l) = 1,</(1) = 
0,</'(l) = V. Immediately, we then have 

Hence arg(l/ g(w)) -^Oasw->linC and there is an 5 > such that for 
arg(z — 1) > 5 and \8\ < 5, z / l/g(e l9 ). For 1 ^ w € C, aperiodicity of X\ 
implies |5(w)| < 1- Let e be the minimum of 5 and the values |<?(e 46, )| -1 — 1 on 
|#| G [— 7r,7r] \ (—5,5). Then z ^ l/g(w) on the Camembert region R(e). □ 



Finishing the proof of Theorem 11.21 we let R be as in the conclusion of the lemma and 
observe that for any closed loop (3 in R, we may exchange the order of integration in the 
representation of G in ()2.8|) to get 

G(z)dz = l [±^ld W dz 

p 7/3 7 7 2? " w 



1 f dw f ^, 
/ — / F(z,w)dz 





= 0. 

By Morera's theorem, G is analytic in R, completing the proof of Theorem 11.21 □ 



3 Probabilistic proof 

Let p n (x,y) := ^x(S n = y) and q n (x,y) := TP x (S n = y, Sj ^ VI < j < n - 1) denote 
probabilities for {S n = y} respectively with or without killing at the origin. These quantities 
are symmetric in the two arguments. Previously defined quantities are related to these by 
< = p n {0, 0) and a n = q n (0, 0). We let Q n := ¥ (S 3 ^ 0, VI < j < n) = EL+i «« be the 
tail sums of {a n }. 

The derivations of ()2.5|) . ()2.6() and ()2.7|) in the probability literature are via the gen- 
erating function analysis in the previous section. At this point, the methods part ways. 
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The probabilistic analysis derives a n from its tail sums, Q n . The reasonably well known 
estimate on Q n is 




This is proved analytically, not via extending a two-variable generating function to a 
Camembert region, but just from (|2.7|) , The key here is that the sequence {Q n } is mono- 
tone. According to a Tauberian theorem which may be found in |Fel71l Theorem XIII.5], 
the extra regularity, together with the behavior of its generating function H/ (1 — z) for real 
z | 1, implies ()3.9j) . 

Since a n+ \ = Q n — Q n +i, the conclusion of the theorem now follows if we can establish 
regularity of a n to the degree that 

a n+1 -a n = 0(n~ 5 / 2 ). (3.10) 

Note that we have now converted the task from one of finding the correct leading term into 
one of finding an upper bound to within a constant factor, which is a problem well suited to 
probabilistic analysis. To complete the regularity argument we need a couple of estimates 
on how rapidly p n (0, x) := Po(<Sn = x) can change with n. We will prove these at the end. 

Lemma 3.1 Under the assumptions of aperiodicity, zero mean and finite variance, there 
is a constant C such that 

\p n (0,x) -p n+ i(0,x)\ 
\p n (0,x) -p n+1 (0,x)\ 

Upper bounds for q n (0,x) are given by 

, . C(\x\ + 

g»(o,x) < - l n3/2 ' (3.13) 

q n (0,x) < -. (3.14) 
n 

We now prove (|3.1Uf) for n = 3m, the cases of 3m + 1 and 3m + 2 being identical. 
Breaking down according to location at times m and 2m we get 

a n = 53m (0,0) = ^ q m (0, x)q m (x, y)q m (y, 0) ■ 



< 



< 



n 



C 

3/2 



c 



(1 + x 2 )n l l 2 



(3.11) 
(3.12) 
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Write q m (x,y) = p m (x,y) — ¥ x (S m = y, 3j 6 [l,m— 1] : Sj = 0). Substituting this in the 
above equation gives 



m— 1 



a n = ^2^2q m (0,x)p m {x,y)q m {y,0) - ^2^2 (0,y)q m (y,0) 

x^O y^O k=l 

m 

^m+kPm+l—k 

x^Oy^O k=ly^0 

and hence 

\a n -a n+ i\ < ^2^2q m (0,x)\p m (x,y) -p m+1 (x,y)\q rn (y,0) 

x^O yj^O 

+ ^2 a 2m Pi (0, y)q m {0, y) 

+ ^2 ^2a m+k \p m+1 - k (o,y) -Pm-k(o,y)\q m {y,o) ■ 

k=l 

We must bound each of the three terms by 0(m~ 5//2 ). The second term is ci2 m a m = 
0(m~ z ) by (|3.13|) . Using (|3.11j) we see that the first term is 

O(m- 3 / 2 )^2q m (0,x)Y,q m (y,0) = 0(m- 3 / 2 )Q 2 m = 0(m~ 5 / 2 ) . 
x^O y^O 

The third term requires a little more care. We will show that 

|p fc+ i(0, y) - Pk (0, y)\q m (0, y) < ok' 1 / 2 [l + log (^)] m" 3 / 2 . (3.15) 



To show this, split into three ranges of values for y, namely \y\ < Vk, \f~k~ < \y\ < \fm 
and |y| > \fm. In the first range we use (|3,11|) and (|3.13|) with \y\ bounded above by k 1 ^ 2 
to see that the summand is 0(k~ 3 / 2 )0(k 1 / 2 m~ 3 / 2 ). There are k 1 / 2 summands, so the total 
sum is 

In the middle range, we use (|3.12|) and ()3.13j) to see that the summand is bounded 
by a constant multiple of k~ l l 2 \y\~ 2 \y\m~ 3 / 2 . Summing over y introduces the factor of 
(1/2) log(m/A;). For the third sum, use (|3. 12|) and (|3.14|) to see that the summand is 
0(k~ 1 ^ 2 \y\^ 2 m^ 1 ), so that summing over y > \fm gives 0(A;~ 1 / 2 m -3 / 2 ). This proves (|3.15|) . 
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Finally, summing (|3,15j) over k < m gives 0(m 5 / 2 ) which establishes (|3.1U|) . finishing 
the proof of Theorem 11,21 □ 

Proof of Lemma l3.1> The simplest of the inequalities is (|3.14|) . so we handle it first. Let 
n = 3m. The bound (|3.14|) follows immediately from 

q n (0,x) < y^ j q m (0,y)p2m(y,x) < a m supp 2m (y,x) = 0(m _1 ). 



To prove ()3.13[) . we decompose according to the position at time m and at time 2m, so 
that 

q n (0,x) = q m (0,y)q m (y,z)q m (z,x) 

< [Q m supq m (y,z))y2q m (z,x) 
= 0{m- l )Q m (x), 

where Q m (x) := F x (Sj / VI < j < m and we have used q m (z,x) = q m (x,z) to infer 
X] 2 qm(z, x) = Qm(x). Thus it suffices to show that 

Qmix) = Odl + lxDm- 1 / 2 ). (3.16) 



Observe that there is a constant c depending on the distribution of X\ but not on y such 
that the probability, call it p(y), of hitting y in at most y 2 steps starting from the origin is 
at least c (use the local central limit theorem to bound the expected number of visits to y 
within the first t 2 steps by from below by ci(l + \y\) and use the Green's function to bound 
the expected number of visits to y given at least one visit from above by C2(l + |y|))- By a 
last exit decomposition, we then have 

c < p(y) < £pj(0,0)7-(y) 

j<y 2 

where r(y) is the probability starting at the origin of hitting y before returning to the origin. 
Using pj(0, 0) = ©(j -1 / 2 ) and solving for r(y) gives 



r{y) = Q 



1 



1+1/ 
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But by (|3,9|) . decomposing according to the time y is first hit, 

— n- 1 ' 2 ~ Q n >r(y)Q n (y) 

7T 

and solving for Q n (y) proves (|3,16|) , 

The bounds on 5 n (x) := \p n (0,x) — p n+ i(0,x)\ are classical (though not all that well 
known) and are obtained by the same means as the local central limit theorem. Let 

<j){e) = g(iO) = Ee i6Xl 

be the characteristic function of X\, so that as we have seen, mean zero, finite variance and 
aperiodicity imply that 



1 



ve 1 



+ o(9 2 



near 1, while 



l-4>{9) < c6 2 
for all \9\ < it. The inversion formula gives 

0(0) n e~ ite d6 . 



We obtain from (|.3.19|) 



p n (0,x) 



S n (x) < 



1 

2^ 



(3.17) 
(3.18) 

(3.19) 



>(0) - l\d0. 



Given (|3.17|) and (|3.18j) . we see this is a saddle point integral with main contribution 
near 9 = 0. In particular, since 4>{0) < 1 - c(9 2 , we know that \(j){B)\ n < cexp(6n 3 / 4 ) for 
n -1 / 8 < \6\ < 7r and we may restrict the integrals to a suitable range such as \6\ < n -1 / 8 . 
Let y = Q^fn. From (j3~T7)) . 





n 

< ce~ ay2 










< c— , 






n 



whence S n (x) = 0(n 3//2 ). 



For (|3.12j) we integrate (|3.19|) twice by parts to get 

„2 /-7T 



Pn(0, x) = - — I [n(n - l)c(>'(e) 2 + n<P"(8)] 0(0)^ ! e — e d0 . 



(3.20) 
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The same truncation and change of variables, together with the estimate 



< c- 1 + |y| 



y 

fn J y'n 

give 5 n (x) = 0((1 + x 2 )n -1 / 2 ) and completes the proof of the lemma. □ 

Acknowledgement: The problems we address were suggested by a question posed by Dr. 
Amy Myers. 
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